163 research outputs found
The Chlamydomonas genome project: A decade on
The green alga Chlamydomonas reinhardtii is a popular unicellular organism for studying photosynthesis, cilia biogenesis, and micronutrient homeostasis. Ten years since its genome project was initiated an iterative process of improvements to the genome and gene predictions has propelled this organism to the forefront of the omics era. Housed at Phytozome, the plant genomics portal of the Joint Genome Institute (JGI), the most up-to-date genomic data include a genome arranged on chromosomes and high-quality gene models with alternative splice forms supported by an abundance of whole transcriptome sequencing (RNA-Seq) data. We present here the past, present, and future of Chlamydomonas genomics. Specifically, we detail progress on genome assembly and gene model refinement, discuss resources for gene annotations, functional predictions, and locus ID mapping between versions and, importantly, outline a standardized framework for naming genes
An integrated computational pipeline and database to support whole-genome sequence annotation
We describe here our experience in annotating the Drosophila melanogaster genome sequence, in the course of which we developed several new open-source software tools and a database schema to support large-scale genome annotation. We have developed these into an integrated and reusable software system for whole-genome annotation. The key contributions to overall annotation quality are the marshalling of high-quality sequences for alignments and the design of a system with an adaptable and expandable flexible architecture
Recommended from our members
Apollo: a sequence annotation editor
The well-established inaccuracy of purely computational methods for annotating genome sequences necessitates an interactive tool to allow biological experts to refine these approximations by viewing and independently evaluating the data supporting each annotation. Apollo was developed to meet this need, enabling curators to inspect genome annotations closely and edit them. FlyBase biologists successfully used Apollo to annotate the Drosophila melanogaster genome and it is increasingly being used as a starting point for the development of customized annotation editing tools for other genome projects
Recommended from our members
Sequencing wild and cultivated cassava and related species reveals extensive interspecific hybridization and genetic diversity
Cassava (Manihot esculenta) provides calories and nutrition for more than half a billion people. It was domesticated by native Amazonian peoples through cultivation of the wild progenitor M. esculenta ssp. flabellifolia and is now grown in tropical regions worldwide. Here we provide a high-quality genome assembly for cassava with improved contiguity, linkage, and completeness; almost 97% of genes are anchored to chromosomes. We find that paleotetraploidy in cassava is shared with the related rubber tree Hevea, providing a resource for comparative studies. We also sequence a global collection of 58 Manihot accessions, including cultivated and wild cassava accessions and related species such as Ceará or India rubber (M. glaziovii), and genotype 268 African cassava varieties. We find widespread interspecific admixture, and detect the genetic signature of past cassava breeding programs. As a clonally propagated crop, cassava is especially vulnerable to pathogens and abiotic stresses. This genomic resource will inform future genome-enabled breeding efforts to improve this staple crop
The Cassava Genome: Current Progress, Future Directions
The starchy swollen roots of cassava provide an essential food source for nearly a billion people, as well as possibilities for bioenergy, yet improvements to nutritional content and resistance to threatening diseases are currently impeded. A 454-based whole genome shotgun sequence has been assembled, which covers 69% of the predicted genome size and 96% of protein-coding gene space, with genome finishing underway. The predicted 30,666 genes and 3,485 alternate splice forms are supported by 1.4 M expressed sequence tags (ESTs). Maps based on simple sequence repeat (SSR)-, and EST-derived single nucleotide polymorphisms (SNPs) already exist. Thanks to the genome sequence, a high-density linkage map is currently being developed from a cross between two diverse cassava cultivars: one susceptible to cassava brown streak disease; the other resistant. An efficient genotyping-by-sequencing (GBS) approach is being developed to catalog SNPs both within the mapping population and among diverse African farmer-preferred varieties of cassava. These resources will accelerate marker-assisted breeding programs, allowing improvements in disease-resistance and nutrition, and will help us understand the genetic basis for disease resistance
The \u3cem\u3eChlamydomonas\u3c/em\u3e Genome Reveals the Evolution of Key Animal and Plant Functions
Chlamydomonas reinhardtii is a unicellular green alga whose lineage diverged from land plants over 1 billion years ago. It is a model system for studying chloroplast-based photosynthesis, as well as the structure, assembly, and function of eukaryotic flagella (cilia), which were inherited from the common ancestor of plants and animals, but lost in land plants. We sequenced the ∼120-megabase nuclear genome of Chlamydomonas and performed comparative phylogenomic analyses, identifying genes encoding uncharacterized proteins that are likely associated with the function and biogenesis of chloroplasts or eukaryotic flagella. Analyses of the Chlamydomonas genome advance our understanding of the ancestral eukaryotic cell, reveal previously unknown genes associated with photosynthetic and flagellar functions, and establish links between ciliopathy and the composition and function of flagella
Recommended from our members
Annotation of the Drosophila melanogaster euchromatic genome: a systematic review
BACKGROUND: The recent completion of the Drosophila melanogaster genomic sequence to high quality and the availability of a greatly expanded set of Drosophila cDNA sequences, aligning to 78% of the predicted euchromatic genes, afforded FlyBase the opportunity to significantly improve genomic annotations. We made the annotation process more rigorous by inspecting each gene visually, utilizing a comprehensive set of curation rules, requiring traceable evidence for each gene model, and comparing each predicted peptide to SWISS-PROT and TrEMBL sequences. RESULTS: Although the number of predicted protein-coding genes in Drosophila remains essentially unchanged, the revised annotation significantly improves gene models, resulting in structural changes to 85% of the transcripts and 45% of the predicted proteins. We annotated transposable elements and non-protein-coding RNAs as new features, and extended the annotation of untranslated (UTR) sequences and alternative transcripts to include more than 70% and 20% of genes, respectively. Finally, cDNA sequence provided evidence for dicistronic transcripts, neighboring genes with overlapping UTRs on the same DNA sequence strand, alternatively spliced genes that encode distinct, non-overlapping peptides, and numerous nested genes. CONCLUSIONS: Identification of so many unusual gene models not only suggests that some mechanisms for gene regulation are more prevalent than previously believed, but also underscores the complex challenges of eukaryotic gene prediction. At present, experimental data and human curation remain essential to generate high-quality genome annotations
Domestication syndrome is investigated by proteomic analysis between cultivated cassava (Manihot esculenta Crantz) and its wild relatives
Cassava (Manihot esculenta Crantz) wild relatives remain a largely untapped potential for genetic improvement. However, the domestication syndrome phenomena from wild species to cultivated cassava remain poorly understood. The analysis of leaf anatomy and photosynthetic activity showed significantly different between cassava cultivars SC205, SC8 and wild relative M. esculenta ssp. Flabellifolia (W14). The dry matter, starch and amylose contents in the storage roots of cassava cultivars were significantly more than that in wild species. In order to further reveal the differences in photosynthesis and starch accumulation of cultivars and wild species, the globally differential proteins between cassava SC205, SC8 and W14 were analyzed using 2-DE in combination with MALDI-TOF tandem mass spectrometry. A total of 175 and 304 proteins in leaves and storage roots were identified, respectively. Of these, 122 and 127 common proteins in leaves and storage roots were detected in SC205, SC8 and W14, respectively. There were 11, 2 and 2 unique proteins in leaves, as well as 58, 9 and 12 unique proteins in storage roots for W14, SC205 and SC8, respectively, indicating proteomic changes in leaves and storage roots between cultivated cassava and its wild relatives. These proteins and their differential regulation across plants of contrasting leaf morphology, leaf anatomy pattern and photosynthetic related parameters and starch content could contribute to the footprinting of cassava domestication syndrome. We conclude that these global protein data would be of great value to detect the key gene groups related to cassava selection in the domestication syndrome phenomena
- …